Stacked Deformable Part Model with Shape Regression for Object Part Localization
نویسندگان
چکیده
This paper explores the localization of pre-defined semantic object parts, which is much more challenging than traditional object detection and very important for applications such as face recognition, HCI and fine-grained object recognition. To address this problem, we make two critical improvements over the widely used deformable part model (DPM). The first is that we use appearance based shape regression to globally estimate the anchor location of each part and then locally refine each part according to the estimated anchor location under the constraint of DPM. The DPM with shape regression (SR-DPM) is more flexible than the traditional DPM by relaxing the fixed anchor location of each part. It enjoys the efficient dynamic programming inference as traditional DPM and can be discriminatively trained via a coordinate descent procedure. The second is that we propose to stack multiple SR-DPMs, where each layer uses the output of previous SR-DPM as the input to progressively refine the result. It provides an analogy to deep neural network while benefiting from hand-crafted feature and model. The proposed methods are applied to human pose estimation, face alignment and general object part localization tasks and achieve state-of-the-art
منابع مشابه
Analysis and Synthesis of Facial Expressions by Feature-Points Tracking and Deformable Model
Face expression recognition is useful for designing new interactive devices offering the possibility of new ways for human to interact with computer systems. In this paper we develop a facial expressions analysis and synthesis system. The analysis part of the system is based on the facial features extracted from facial feature points (FFP) in frontal image sequences. Selected facial feature poi...
متن کاملHierarchical vibrations for part-based recognition of complex objects
We propose a technique for localization of complex shapes in images using a novel part–based deformable shape representation based on finite element vibration modes. Here, our method gives an extension for Finite Element Models to represent elastic co–variations of discrete variable shapes. It avoids misregistration by resolving several drawbacks inherent to standard shape–based approaches, whi...
متن کاملDeformable Part-based Fully Convolutional Network for Object Detection
Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultane...
متن کاملClass-specific 3D localization using constellations of object parts
We address the problem of learning class-specific, deformable, 3D part-based structure for object localization, on fully connected (FC) partbased graphs, along the lines of Pictorial Structures (PS) [1], Constellation Models [2] and ISM [3]. Further the above object part localization is used to find dense correspondences between class 3D models. Our results show improvement over state of the ar...
متن کاملInferring 2D Object Structure from the Deformation of Apparent Contours
We present a new integrated approach to the two-dimensional part segmentation, shape and motion estimation of moving multi-part objects. Our technique exploits the relationship between the geometry and the observed deformations of the apparent contour of a moving multi-part object and its structure. The novelty of the technique is that no prior model of the object or of its parts is employed. W...
متن کامل